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Abstract 

Network control refers to a very large and diverse set of problems including controllability of linear time-invariant dynamical 
systems, where the objective is to select an appropriate input to steer the network to a desired state. There are many notions of 
controllability, one of them being structural controllability, which is intimately connected to finding maximum matchings on the 
underlying network topology. In this work, we study fast, scalable algorithms for finding maximum matchings for a large class 
of random networks. First, we illustrate that degree distribution random networks are realistic models for real networks in terms 
of structural controllability. Subsequently, we analyze a popular, fast and practical heuristic due to Karp and Sipser as well as a 
simplification of it. For both heuristics, we establish asymptotic optimality and provide results concerning the asymptotic size of 
maximum matchings for an extensive class of random networks. 

Index Terms 

Maximum Matching, Karp-Sipser, Structural Controllability, Network Control, Random Networks. 


I. Introduction 

N ETWORKS are capable of capturing relationships between a set of entities (vertices) and have found applications in 
diverse scientific fields including biology, engineering, economics and the social sciences in, 0 , 0 - Network control 
refers to a very large and diverse set of problems that involve control actions over a network (see for example, a, s, 0 , 
0 , 0 , 0 , lUOl . iflTl . Ifl2l . H~3l and references therein). 

A class of control problems involves dynamical systems evolving over time that have inputs and outputs and many results 
exist for systems that exhibit linear and time-invariant dynamics HU- One particular notion of control is that of structural 
controllability 0, ED, which was recently explored by Liu et al. ED. Under this notion, the structural controllability problem 


reduces to find maximum matchings on appropriate matrices as reviewed in Section I-A The problem of obtaining maximum 
matchings has been extensively studied in the computer science literature both for deterministic ED as well as random networks 
ESI. However, the focus in the literature has been on special classes of undirected random networks on, m. For example, 
using the results of Tao and Vu ED, one can infer that for dense Erdos-Renyi random networks, a single controller is sufficient 
to ensure controllability. Little is known about the performance of matching algorithms in other interesting classes of random 
networks Go), with an exception of the recent work of Balister and Gerke GD- We focus on degree distribution random 
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network models, and establish results on the minimum number of controllers needed to guarantee structural controllability. We 
also show that these models are realistic representations for real world applications. 

A popular, fast and practical algorithm for matchings on undirected random networks is due to Karp and Sipser ll22l . which 
represents the cornerstone of our theoretical investigations and through it we provide generalizations of previous work in the 
literature to broader classes of undirected random networks. Further, we also extend the results for directed variants of the 
same classes of random networks. 

A. Structural Controllability and Maximum Matchings 

Next, we review some key concepts in structural controllability for linear dynamical systems. Consider a system described 
by a n-dimensional state vector x(t) = (x\(t), ...,x n (t.)) T £ R", whose dynamical evolution is described by 

— = Axlt ) + Bu(t), 
at 

where A £ R nx ” is the system transition matrix, u(t) = (u\ (t),..., ttfc(f)) T £ R fc captures control actions and B £ M. nxk 
is the input matrix. Assuming that the n-dimensional system can be represented by vertices on a network G = ( V , E) with 
V = {1,2,..., n } denoting the set of vertices and E C V x V the set of edges, it can be seen that the non-zero entries in the 
transition matrix A correspond to the directed edges in E. Indeed, for i,j £ V the edge i —> j exists if and only if vertex i 
influences vertex j, i.e. A lt ^ 0. Such a system is called controllable if for any initial state x(0) = xq and any desired state 
Xd for some T < oo one can find an input matrix B and control vectors {u(t) }o<t<T so that the system reaches state x,i i.e. 
x(T) = Xd- The minimum k for which system can be controllable is called the minimum number of controllers. 

The magnitude of the entries in the transition matrix A captures the interaction strength between the vertices in the network; 
for example, the traffic on individual communication links in a communications network or the strength of a regulatory 
interaction in a biological network. The time invariant matrix B indicates which vertices are controlled by an outside controller. 
Hence, the set of vertices that when applying controllers to them makes the system controllable needs to be identified. 

Note that we don’t assume any constraint on the number of nonzero elements in the columns of input matrix B, i.e. one 
controller Ui(t) can influence multiple vertices. The case where every controller can be applied to only one vertex in the 
network (which in turn implies that the goal is to find the minimum number of vertices the controllers can be applied to) is 
studied by Olshevsky l23l . He shows that finding the exact solution to the problem is NP hard. 

The algebraic criterion to check controllability of a time invariant linear dynamical system is Kalman’s controllability rank 
condition, that states that controllability can be achieved, if and only if the matrix C = [B, AB ,..., A n ~ 1 B] is full rank; i.e. 
rank(C) = n. Note that C £ R nxnk . This algebraic criterion is computationally hard to check, especially for large systems. 
Further, in many applications, obtaining exact values of A may not be feasible and hence a tractable alternative is needed. 

Thus, we say that a time invariant linear dynamical system is structurally controllable , if it is possible to select the non-zero 
values of A, B, so that Kalman’s rank condition is satisfied JT|. A structurally controllable system is controllable for almost all 
A, /i; i.e. the pathological cases for which a structurally controllable network is not controllable has zero Lebesgue measure. 
The relation between the minimum number of controllers needed to structurally control a network and the size of its maximum 
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matching has been presented in several forms (see (24)). The version used in the current work is the one appearing in Liu et al. 
m “minimum inputs theorem” (stated below). Moreover, similar equivalence between structural controllability and maximum 
matching when every controller can be applied to only one vertex, is studied by Assadi et al. l25l . 

Furthermore, Commault and Dion (26) study the problem of using only a single controller applied to as few vertices as 
possible. More general results regarding the control configuration selection can be found in the work of Pequito et al. G3 
which discusses relations to maximum matching problem as well. Other considerations must be taken into account for deciding 
which approach is most suitable for the practical application under consideration. Finally, we have no assumption regarding 
self-loops in the network. Cowan et al. Il28l study networks, where every vertex influences itself. 

Algorithms to find a maximum matching are well studied in the literature and exhibit polynomial time complexity (with 
respect to the size of the network). A popular one developed by Micali and Vazirani |[29| has running time O (|L| 0 5 |£’|). 

Next, for completeness we provide a definition of maximum matching and also state the minimum inputs theorem. 

Definition 1. For a directed network G = ( V , E), a subset of edges M is a matching, if no two edges in M share a common 
starting or a common ending vertex. A maximum matching corresponds to a matching of maximum size. 

Definition 2. Given a matching M for directed network G = ( V. E), a vertex is matched, if it is an ending vertex of an edge 
in the matching M. Otherwise, it is unmatched. 

Minimum Inputs Theorem [12): Let M* be a maximum matching of the network G = (V, E). The minimum number 
of controllers needed for structural controllability of the netw’ork is max{l,n — |M*|}. Moreover, for any matching M, the 
network is struturally controllable using max{l,n— |Af|} controllers. 

The upshot of this result is that in order to find the minimum number of required controllers for structural controllability we 
can equivalently find the size of a maximum matching. Furthermore, one can explicitly find the structure of the input matrix 
B according to the proof of minimum inputs theorem. In fact, one can see that a matching is formed of a set of directed loops 
and a set of directed paths. The first vertex of each path is unmatched, while all other vertices are matched. According to 
Lin’s work 0], one controller is used to actuate every unmatched vertex. Amongst matched vertices, some might need to be 
actuated by a controller, but no new controller is needed, as it suffices to apply any of the previously used controllers to one 
arbitrary vertex of each loop. So the number of nonzero components in B £ R nxfc , or equivalently the total number of the 
connections between the inputs and vertices in the network, is at most n. More details are provided by Liu et al. (Supplementary 
Information of itTZI l. 

As the problem of finding the minimum number of controllers needed for structural controllability of a network reduces to 
maximum matching type of problems, henceforth we use the (minimum) number of unmatched vertices and the (minimum) 
number of controllers interchangeably. In this work, we provide results about the size of matchings obtained by different fast 
algorithms for classes of random networks. In fact, Karp and Sipser 822) proved that, for the classical undirected Erdos-Renyi 
random network, the KS algorithm is optimal. We generalize their results to a larger class of random networks. The remainder 
of the paper is organized as follows. In Section [n] we introduce different classes of random networks subsequently studied in 
this work. Furthermore, some probabilistic results needed for technical developments are summarized. The main algorithms 
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studied are introduced in Section [III] and connections to real networks using some numerical examples are provided in Section 
[Tvl The key results of the paper are presented in Sections [V] (analysis and optimality of the algorithms). 

Notations: For set S, let 151 be the number of elements in S. ( ) = —, n! —.. For vertices u.v £ V in the network 

G = (V,E), { u,v } ((u,v) or (v,u) in directed or bipartite networks) denotes an edge, so N = M U {{u, u}} means adding 
the edge {u, w} to M gives N. G — {u , i>} means removing vertices u, v (and so all edges connected to them) from network G. 
Further, deg G (v) denotes the degree of v in G i.e. the number of edges in G connected to v. When there is no subscript, the 

n 

network G is identifiable from the context. Finally, for x £ R n , n £ {0,1, 2, ...}U{oo}, ||x||i is norm of x: ||cc||i = \xi\. 

»=l 


II. Random Networks 


In this section, we introduce different classes of random network models and then present some general results about 
convergence and concentration of real-valued functions on networks. In order to have a general framework which includes 
both directed and undirected networks, we note that every undirected network G = ( V , E ) can be considered as a directed 
network in which, for all vertices i,j, both edges i —» j and j —> i exist, if and only if the edge i ££ j exists in the original 
undirected network. All statements presented are true for both directed and undirected networks unless explicitly mentioned. 
For a comprehensive discussion on constructions and properties of (undirected) random networks, see chapter 3 in Durrett 

ED. 

The first model for random networks we consider is the Erdos-Renyi ( ER ) model. A directed network G = (V. E), 
V = {1,2, is (drawn from) ER if every edge i —> j for i , j = 1,2, ...,n is present in the network independently 

with probability pW. Analogously, an undirected network G = (V,E), V = {1,2, ...,n} is ER if every edge i -fA j for 
i,j = 1.2,..., n. i < j is present in the network independently with probability j/ n> . Henceforth, for A £ [0, oo], ER( A) is a 
Erdos-Renyi random network, for which np^ —» A as n —> oo. In ER( A) random networks, the parameter A corresponds to 
the average degree. 

The next model we consider is the Uniform Fixed-Size ( UFS ) model. A directed network G = ( V , E), V = {1,2,..., n\ is 
UFS when the cardinality of the edge set /:’ = k n for some fixed k n , and the k n directed edges are drawn uniformly among 
all n 2 possible edges. The construction for the undirected network is similar, but the k n edges are chosen uniformly among 


all 


i(n+l) 


possible edges. For A £ [0, oo], we denote by UFS( A) a random network of the UFS class, for which fe " 


A for 


directed and ^ —>• ^ for the undirected case, as n —> oo. Once again, the A parameter corresponds to the average degree. 


Finally, we introduce the class of Degree Distribution (DD) random networks. There are a couple of reasons for considering 
this class. First, it lets us consider networks with degree distributions commonly found in real networks that simpler models 
such as ER (where the degree distribution is Poisson) cannot model. For example, as shown in Fig. [T] the degree distribution 
of real world networks can be any arbitrary non-parametric distribution. 

Histograms demonstrating input and output degree distributions of some other real world networks can be found in Fig. [2] 
Second, as empirically shown in Section |IV| the number of controllers for structural controllability of a network is, to a 


large extent, determined by its degree distribution. More details can be found in Section IV as well as the work of Liu et al. 


m. 
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Figure 1: Degree distribution histogram of the social network between prison inmates. For more information see Table [i] (blue: 
input degree, red: output degree). As shown here, it is difficult to model degree distributions of real networks using standard 
parameteric distributions. 


Ieader2inter-st college student 



40 


20 


celegansneural 


ffllliiu, 


10 20 30 40 50 


p2p-Gnutella06 









Figure 2: Degree distribution histogram of some networks. For more information see Table [T] (blue: input degree, red: output 
degree). 


An undirected random network is a member of the DD class, if for a given degree distribution the attachment of edges 
is random. Specifically, let p be a probability distribution with support on the set {0,1,2,...} of nonnegative integers. We 
then construct an undirected network DD(p) = (V. A) as follows. Let V = {1,2,..., n} and for i e V, let vertex i have D t 
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undirected half-edge(s) (one-half of an edge is connected to vertex i). Note that !) \,.... l) n are the corresponding degrees 
which are assumed to be independent and identically distributed (iid) with distribution p; P( I), = k) = p(k). To complete the 

/E Di. 

construction, we then pair all half-edges randomly; i.e. all (.=1 J possible attachments of half-edges have equal probability. 


n 

When the number of half-edges ^2 Di is an even number, the construction is straightforward and the number of edges will 

i= 1 
n 

b e \ 'Yh Di- When it is an odd number, we pair the half-edges randomly to obtain the network and omit the last single half-edge 

2=1 

n 

for which no pairing was established at the end of the construction, so that the number of edges will be Di — 1). Note 

*=1 

that the omission or presence of multiple edges will lead to a difference between D 1 ,..., D n and the actual observed degrees 
deg(l),..., deg(n) once the network construction is completed. However, as Lemma [I] below establishes, the asymptotic 
empirical degree distribution will be the original degree distribution from which the network was constructed, as long as the 
expected value of Di, El), is finite. 


Viewing an undirected network DD(p) as a directed one, both input and output degrees of vertex i are deg(Y). To construct 
a directed DD random network, denoted by DD(pi n ,p out ), with distinct input (deg m ) and output (deg out ) degrees, we do the 
following: once we have iid draws and D ( j °" t> from the input and output degree probability distributions pi n and p out 

respectively: 

P (A (in) = k) = Pin (*0, P(A (OUt) = AO = Pont (AO, 


let vertex i £ V have D[ ln ' 1 directed half-edges pointing into vertex i and D < f ut ' > directed half-edges pointing out from vertex 


i. Next, we pair directed half-edges randomly to have min{ D^ ln \ Df uz> } edges and omit the remaining half-edges. 


-v(oiit) 


2=0 


2=0 


The random pairing of half-edges implies that all I max { D) ln , ^ D f " } )! possible pairings of half-edges are equally 

V i=0 j-o 

likely. Note that D^' n \ c | 0 no t need to be independent. 


Furthermore, in general, the degrees do not need to be iid. In fact, as shown later in the paper, the key asymptotic 
results we establish are based on the empirical degree distributions which are, by the following lemma, same as the original 
degree distributions when vertex degrees are iid. However, as long as for all k = 0,1,..., lim \i l ^ v - D ^ k )\ (equivalently 


lim |{»eV:D ( - ) =fc}| lim |{jgV r : D)° ut) =fc}| 
n —>oo n ? n— Kx> 71 

distributions. 


) are deterministic, our results hold using the resulting asymptotic empirical degree 


Lemma 1. For an undirected network G = ( V,, E), define the asymptotic empirical degree distribution as 

m = lim 

n—too Tl 

oo 

If G = DD(p) is a random netw’ork and p = ^ kp(k) < oo, then the limit above exists and we have p(k) = p(k) for all 

k= 0 

k = 1,2,.... In general for a network G = (V. E), define the asymptotic empirical input and output degree distributions as 

|{* e V : deg in (i) = k}\ 


Pin(k) = lim 


Pout(k) = lim 


l(* e V : deg out (i) = k}\ 
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If G = DD(pi n ,p out ) z's a random network and p = kpi n (k) = ^ kp ou t(k) < oo, then for all k = 1, 2,... f/ze limits 

k =0 fc=0 

above exist and we have Pi n {k) = Pin{k),p ou t{k ) = Pout(k). 

Henceforth, for all networks by Pi n ,p ou t we mean asymptotic empirical degree distributions Pi n ,p ou t respectively. 

Next, we define the Lipschitz property for real valued functions defined over networks. 

Definition 3. Let f be a real-valued function on the set of directed networks. We say f has the Lipschitz property, if 
|/(Gi) — /(G 2 )| < 1 whenever G\ = (V, Ef), G 2 = ( V , E 2 ), E 2 = E\ U {e}; i.e. the value of f will change at most by 1 if 
one new edge is added to the network. 

Remark 1. Properly defining the norm || ■ || on the networks, one can see the Lipschitz property is the same as the classic 
notion |/(Gi) — /(G 2 )| < ||Gi — G 2 ||. Indeed, for every network G = (V,E), letting -4(G) be the adjacency matrix of G 

, , , , ~ \v\ ' 

(A(G) £ Rl y |x|y| ; f or g y ► j, A(G)ji = 1 and A(G)ji = 0 otherwise), define ||G|| = ||^4(G)||i = \A(G)ij\. 

i,j =1 

Then Gi, G 2 differ by only one edge if and only if\\A{Gi)— _4(G 2 )||i = 1. So f has the Lipschitz property if |/(Gi) — /(G 2 )| = 
1 / (^(Gi)) - / (A(G 2 )) I < M(G0 - -A(G 2 )|U. 


Next, we present convergence and concentration inequalities for functions of random networks which have the Lipschitz 
property. Specifically, the number of unmatched vertices (or equivalently the number of controllers) obtained by a matching 
algorithm, has the Lipschitz property as shown in Section [V] The consequence of the Lipschitz property for a function defined 
on a “not too dense” random network, is that it concentrates around its expected value. Further, if the average degree of the 
network is finite, then the concentration occurs exponentially fast. These results are summarized in the following theorem. 


Theorem 1 (Convergence of real-valued functions for random networks). Let G = ( V ., E) belong to the ER, UFS, DD(pi n ,p out ) 
or DD{p) class. For a real-valued function f which has the Lipschitz property, if 


lim sup 

n—f 00 


EM 

n 2 


= 0, 


then -G G ) e(/(g)) _^ p q as n ^ 00 . If in addition, 


lim s up E (|fi|) 1 °g’» =o | 
n—>00 r ri 


then e(/(G)) q a g as n qq. When 

such that for every 0 < e < 1 : 


sup 

n> 1 


n 


< 00 the rate of convergence is exponential; i.e. there is C > 0 


|/(G)-E(/(G))| 

n 


> e) < 2exp(— nCe 2 ). 


III. Matching Algorithms 

Before studying the algorithms, we provide a description of networks which will be useful later. As mentioned before, every 
undirected network can be considered as a directed one. Now to have a better understanding of how the algorithms work we 
view every directed network as a bipartite network G = (L, R, E) where L = R = V, E C L x R, L and R are respectively 
the left and the right side of the bipartite network, and for l £ L,r £ R there is an edge (/. r) £ E if and only if in the original 
directed network there is an edge from l to r: l —> r. Henceforth we will only deal with bipartite networks. 








Matching algorithms take a network as input and produce a matching as output. Maximum matching algorithms will give 
a matching of maximum size. Algorithm |T| is the well known Greedy Algorithm that produces a suboptimal matching Mg in 
general. For an arbitrarily chosen vertex v on the right side. Greedy tries to find an arbitrary vertex u on the left side which 
is connected to v and has not been used before by any other vertex on the right side. Note that as mentioned above, networks 
can be assumed to be bipartite. Moreover, clearly the time complexity of Greedy is 0(n). 


Algorithm 1 : Greedy 

Input: G = (L, R, E ) 

Output: matching Mq{G) 

Mg 0 

while E ^ 0 do 

let v £ R 

if deg(zz) = 0 then 
G^-G-{v} 

else if for zt G L, (zt, v) € E then 
G<^G-{u,v} 

Mq ■£- Mq U {(it, u)} 

end if 
end while 
return Mq 


Note that Greedy picks an arbitrary vertex v € R in every iteration. Because the goal is to find a matching of largest 
possible size, this strategy for picking a vertex can be improved. First note that for every vertex v £ R of degree one, there 
is a matching of maximum size in which v is matched. The logic is as follows. Let zt £ L be the vertex on the left side 
connected to v, (zt, v) £ E. There must exist a vertex on the right side, w £ R, such that (zt, w) £ E, such that w is matched 
to zt by a matching M of maximum size; since, if not, adding (zt, w) to it leads to a matching of larger size. Now defining a 
new matching M' which is exactly M with ( u,w ) removed and (u,v) added, i.e. M' = MU {(zt,z;)} — {(zt,zx;)}, we have 
\M\ = M' i.e. M' is a maximum matching as well. Hence as long as we can find a vertex of degree one, we can find 
a matching of exactly maximum size. In other words: no mistake occurs as long as a degree one vertex is picked in every 
iteration of Greedy (a mistake occurs if in an iteration, the algorithm adds an edge to the matching which is not optimal, i.e. 
leads to a deviation from maximum matching). 


Algorithm 2 : Karp-Sipser 

Input: G = (L,R,E) 

Output: matching Mks{G) 

M ks <- 0 

while E ^ 0 do 

let v = are; min degfzn) 
wglur 
if deg(z;) = 0 then 

G^G-{v} 

else if for u £ L U R, {zt, v} £ E then 
G <— G — {zt, u} 

M ks <- M ks U {{zt, zt}} 

end if 
end while 
return Mks 
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This fact is the idea behind Algorithm |2j called the Karp-Sipser Algorithm (KS) l22l . which produces a matching Mks- 
In every iteration of KS, among all vertices a vertex of minimum degree is picked. 

Regarding time complexity of the KS algorithm, note the following connection with the Greedy one: in every iteration, I\S 
picks a vertex of minimum degree and by using a “Heap” data structure, finding a vertex of minimum degree has complexity 
0(1) RTl ; therefore, the running time of KS is linear. Further, since for real networks, the average degree is finite, for each 
iteration of the KS algorithm (i.e. excluding up to o(n) iterations) there are 0{n ) many vertices of degree one, which implies 
that determining a degree one vertex takes on average in 0(1) steps. 

We can simplify the KS algorithm and search for a minimum degree vertex among vertices on only one side to derive 
Algorithm [3] that we call the One-sided Karp-Sipser (OKS) Algorithm and whose output we denote by Moks ■ As we will 
see later in the analysis of real networks, the size of the matching given by OKS is usually less than or equal to the size of 
the matching given by KS. The intuition behind this is as follows. It is possible to make a mistake in OKS because of lack 
of degree one vertices on the right side, but if degree one vertices exist on the left side, I\S can still work optimally. Yet, 
later we will prove (asymptotic) optimality of both algorithms. 


Algorithm 3 : One-Sided Karp-Sipser 

Input: G = (L,R,E) 

Output: matching Moks(G) 

Moks 0 

while E ^ 0 do 

let v = argmin deg(w) 
wGR 

if deg(u) = 0 then 
G^G-{v} 

else if for u € L, (it, v) € E then 
G^G-{u,v} 

M 0 ks M OKS U {(it, u)} 

end if 
end while 
return Moks 


IV. Numerical Examples 

In this section, we present the results of selected numerical analyses on real world networks. We study the number of 
controllers, or equivalently the number of unmatched vertices for 10 different networks. This provides support to studying DD 
random networks as a realistic model for control applications. In other words, using Fig. [4j to find the number of controllers 
needed to structurally control a network, one can assume real networks are in fact DD random networks. 

The full description of these networks can be found in Table |T] 


Table [n] contains the results for 10 networks, including social, internet, web, electronic, neuronal, power grid and tran¬ 
scriptional regulatory networks, enumerated by the first row of the table. The second (third and fourth respectively) row is the 
number of controllers needed for structural controllability of the corresponding network if OKS (KS and Maximum Matching 
respectively) algorithm will be used. 
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number 

network 

type 

M 

\E\ 

description 

1 

leader2inter-st-college- 
student lf32l 

Social 

32 

96 

directed 

2 

celegansneural t~33l 

Neuronal 

297 

2345 

directed 

3 

p2p-Gnutella06 j34j 

Internet 

8717 

31525 

directed 

4 

polblogs (351 

WWW 

1490 

19025 

directed 

5 

prisoninter-st-prison- 
inmates ll32l 

Social 

67 

182 

directed 

6 

s208-st J36j 

Electronic Circuits 

122 

189 

directed 

7 

s420-st {361 

Electronic Circuits 

252 

399 

directed 

8 

s838-st |36) 

Electronic Circuits 

512 

819 

directed 

9 

USpowergrid-n4941 031 

Power Grid 

4941 

13188 

undirected 

10 

yeastinter-st 1361 

Transcriptional Regulatory 

688 

1079 

directed 


Table I: Real networks used in this paper 



Networks 

Algorithms 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

\V\ - \Moks\ 

6 

51 

5033 

703 

10 

29 

59 

119 

610 

565 

\V\ - \M ks \ 

6 

49 

5033 

702 

9 

29 

59 

119 

577 

565 

* 

6 

49 

5033 

702 

9 

29 

59 

119 

575 

565 

average of \V | — \Mqks\ for 
random network 

5.6 

29.8 

5006.8 

616.5 

10.1 

24.7 

48.6 

100.5 

454.9 

557.7 

\V\ = n 

32 

297 

8717 

1490 

67 

122 

252 

512 

4941 

688 


Table II: The number of controllers given by different algorithms for different networks as well as the average number of 
controllers for equivalent degree distribution random network and the size of the networks. 


The fifth row shows the average number of controllers for a random network generated by degree distribution model for 
random networks, i.e. input and output degrees of all vertices are the same as the original real-world network but the attachment 
of half-edges is random. This random attachment is based on a simple fact that every random permutation is superposition of 
large enough number of random swaps. Finally, the last row of the table shows the size of the networks. As seen in the table, 
OKS and KS perform very close to Maximum Matching for all networks. Moreover, network 9 is the only one for which 
the performance of OKS is significantly different than KS. We suspect that this is because network 9 is the only undirected 
network. 

The numerical results in Table [11] can be understood better by the following figures. Fig. [3] shows the performance of 
OKS and KS versus Maximum Matching, i.e. rows 2,3 of the Table [II] versus row 4. The similarity in performance between 
OKS , KS and maximum matching algorithms is better depicted in Fig. [3] 

The number of controllers for equivalent degree distribution random network versus the original network, i.e. row 5 of the 
Table [II] versus row 4 is shown in Fig. [4] According to this plot, degree distribution random networks are sufficiently realistic 
models for real-world networks in terms of the number of controllers needed for structural controllability. 

V. Matching Algorithms in Random Networks 

In this section, we present results about the asymptotic size of matchings produced by the algorithms presented above. 
We first consider the general case where the asymptotic degree distribution of the random network is any arbitrary degree 
distribution with finite mean. Then, more detailed results regarding the special case for a Poisson asymptotic empirical degree 
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Figure 3: The number of controllers given by OKS ( o ) and KS ( x ) versus the number of controllers given by Maximum 
Matching for 10 different real networks. 



Figure 4: The number of controllers for randomly rewired networks versus the number of controllers for the original networks. 

distribution will be provided. Even though as seen before, the latter case is not common among real networks, because of 
classical interests on ER and UFS random networks and the KS algorithm, as well as comprehensiveness, we also study it. 

Before proceeding with the analysis of matching algorithms in an extensive class of random networks, we must ensure 
that the size of the matchings provided by either Greedy, KS, OKS algorithms or any maximum matching algorithm has the 
Lipschitz property in order to have convergence of j \ m oks(G)\ an( j Iff J G) I for random network G where 

M*(G ) is a maximum matching of network G. The following lemma establishes the desired Lipschitz property. The size of 
the matching provided by any of the above algorithms has the Lipschitz property due to the recursive nature of the algorithms. 
The Lipschitz property for the size of maximum matchings comes from their maximality regardless of the algorithm used to 
obtain the maximum matching (this follows easily from the definition of maximum matching). 

Lemma 2. The real-valued functions \Mq\,\Mks\> \Moks\ an d \M*\ which are the size of matchings provided by Greedy, 
KS, OKS and maximum matching algorithms, respectively, have the Lipschitz property. 

A. Arbitrary Degree Distribution 

We establish the optimality of OKS algorithm which immediately yields optimality of KS as well, for reasons explained 
before. For this purpose, we follow in the footsteps of Karp and Sipser l22l and embed the dynamics of both input and output 
degree sequences as the algorithm proceeds in continuous time. This embedding provides differential equations governing the 
degree sequence vectors. However, in the general degree distribution case, unlike the classic Erdos-Renyi case, the differential 
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equations are defined in arbitrarily high dimensions. So there is little hope of working in fixed small dimension as Karp and 
Sipser ll22l did (their differential equations were 3 dimensional) and new ideas are needed. The key idea in our proof is to 
use the differential equations to show that the number of iterations when there is no degree one vertex (and so the algorithm 
can possibly make a mistake) is sublinear (in n) which means the fraction of unmatched vertices (or equivalently the relative 
size of the number of controllers) given by OKS is asymptotically the same as that of maximum matching. Finally, a set of 
equations for the relative size of maximum matching according to asymptotic empirical input and output degree distributions 
is provided. 


Theorem 2 (Asymptotic optimality of OKS algorithm). For network G = (L, R, E),\R\ = \L\ = n let \Moks{G)\ and 
\M*(G)\ be the size of matching given by OKS algorithm and the size of maximum matching respectively. Let G be either 

. . oo oo 

ER, UFS or DD random network with finite average degree, i.e. lim - — = V] ipi n (i) = V] L Pout (*) < oo (where Pi n ,p 0 ut 

n—>oo n 

are asymptotic empirical degree distributions). Then 


2 = 0 


2=0 


llm i-'wioi = llm m 


Remark 2. Under the assumptions of Theorem 2 one can show the asymptotic optimality of KS as well: lim l Mg s( G )l = 

n—too n 

lim om jf f[ le proof here as it is very similar to the proof of The oremhl 

n—ioo n 


Note that in Theorem |TJ letting e = n~ r for every r > \ the convergence holds. So the difference between \Moks\ and 
\M*\ is 0(^/n). Now the following questions arise: (*) what is the size of maximum matching? ( ii ) how can we compute the 
answer (asymptotically) without running the algorithm? The following theorem gives the size of maximum matching in terms 
of input and output degree distributions. For u £ [0,1) define moment generating functions: 

OO OO 

*&in (tt) — ^ ^ Pin ^out (tt) — ^ ' Pout ; 

i— 0 i= 0 

: / \ 1 / x \T^ ipin{i) i -1 

i=l ^ 

1 ( \ 1*/ , \ ST' ip out {i) i -1 

^ i=l V 

00 00 

where p= iPin{i) = ipoutii) is the average degree. 

2=0 2=0 


Theorem 3 (Size of Maximum Matching). For (either ER, UFS or DD) random network G = (L,R,E), |i?| = \L\ = n if 
p < oo let U* be 


U* 


- <b in ( 1 -W i) + <& in (w 2 ) + $ ou t(l - W 3 ) - 2 
$ out{w4 ) + p (w 3 (l - W 2 ) + Wi(1 - W 4 )) 


(1) 
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where (wi,W 2 ,W 3 ,wf) £ [0, l) 4 is the smallest solution of 


</w(l - w 3 ) = 1 - W 2 , f ining) = W3, 


t (l - Wi) = 1 - Wi , (jfoutim) = W 1, 


then the asymptotic fraction of unmatched vertices is U*: 


n—>oo Ti 

Note that the formula |T]i hrst appeared in Liu et al.’s work lfl2ll and was supported by numerical experiments. The result 
above formally proves its validity. 


B. Poisson Degree Distribution 

We first present selected asymptotic and non-asymptotic results about the fraction of matched vertices for the matching given 
by Greedy. Subsequently we generalize the results provided by Karp and Sipser ll22l on the performance of KS. 

Studying a simple algorithm (Greedy) on a simple random network G (drawn from directed ER where every edge i —> j 
exists with probability p ) allows us to obtain explicitly the non-asymptotic probability mass function for \Mq(G)\. The following 
theorem also provides the asymptotic behavior of \Mq{G)\ for directed ER. 


Theorem 4 (Greedy for directed ER). Let the network G be directed ER of size n. Then: 

P(|M g (G)| =n-k) = 


Un(q) 2 


\Mg(G)\ 


= 0. If A = oo then 


a k (q) 2 a n -k(q) 

i 

where q = 1 — p and a.i(q) = II (1 — q 3 ). For ER{\) (i.e. np —> A) if A = 0 then lim 

j—l n-¥ oo 

lim = 1. For A £ (0,oo), \Mq(G)\ is asymptotically normal: 

n—>■ oo n 

f X — log(2 — e -A ) 1 
-A-’ n 4A 

Some of the results in Theorem [4] remain valid for a larger class of random networks including directed and undirected 
networks. 


Theorem 5 (Greedy for asymptotically Poisson degree distributions). Assume G is one of ER(X), IIFS(X'), DD(p) and 
DD(p,p) where probability distribution p is Poisson(A). If X = 0 (respectively X = oo) then lim = 0 (respespectively 

n—>oo n 

1). For X £ (0, oo), lim ^ Mg ^ g ^ = 1 — Iog ( 2 ~ e —!. 

n—> oo n A 

Similar results hold for KS. In fact, Karp and Sipser lf22l proved that, for the classical undirected Erdos-Renyi random 
network (denoted by undirected ER( A), A £ (0, oo) here), KS is optimal. They split the running of the algorithm into two 
phases. Phase 1 begins when the algorithms starts and finishes the first time there is no vertex of degree one in the network, 
when phase 2 starts and proceeds until the algorithm removes all edges from the network. For network G let (7(G), Ut (G) 
and [/ 2 (G) be the number of vertices left unmatched (became of degree zero before being removed from the network) when 
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running maximum matching, phase 1 and phase 2 respectively, H = (V'. E') be the remaining network at the beginning of 
phase 2. Hence \M*(G)\ = n — U(G ), \Mks{G)\ = n — U\(G) — [/ 2 (G), \Mks{H)\ = \V'\ — [/ 2 (G). Since there is no 
deviation from maximum matching as long as vertices of degree one exists, we have U\(G) < U(G) < U-\ (G) + 172(G). Karp 
and Sipser show U2 ^ —> 0 as n —> 00 so the algorithm is optimal, i.e. lim \ Mk s(G)\ = li m 1 AM 211 Further, they show 

Ul ^ —► k( A) and —► h( A) and find functions k,h as k( A) = 7 * +7 A +7 * 7 - l,h(X) = ^ 1 ~ 7 *- ) ^ 7 where 7* is the 

smallest root of 7 = Aexp(—Ae -7 ) and 7* = Ae -7 *. For A < e we have h( A) = 0 because of 7* = 7*. In the following 
theorem, we generalize these results to a larger class of random networks. 

Theorem 6 (KS for asymptotically Poisson degree distributions). Assume G is one of ER(A), UFS(X), DD(p) and I)l)(p. p) 
where probability distribution p is Poisson(A). If A = 0 then lim = li m = 0. If A = 00 then 

n—>00 n n—>00 n 

\m K s(G)\ _ I M (G)| _ j p or ^ g ( 0 , 00 ), we have 

n—>00 n n—>00 n 

lim = lim KM = , _ k[X) . 

n—f 00 fl n—f 00 Tl 

Furthermore, —>• 0 , khJffl k(X) and — > h( A) as n ^ 00 . 

Remark 3. Note that the results in Theorem [6] are consistent with Theorem^as follows. Letting Pi n ,Pout be Poisson(A), 
calculations show U* = k( A) because of p = X, &i n (u) = & 0 ut{u) = fin(u) = <p 0 ut{u) = exp (A (u — 1)). 

Results presented in Theorems [5] and |6] are based on the fact that in all mentioned random networks, the asymptotic empirical 
degree distribution is Poisson. 

VI. Conclusion 

We proved that the OKS algorithm is (asymptotically) optimal for determining a set of vertices where controllers should be 
applied to. Indeed, first benefiting from the connection between structural controllability of networks and maximum matching 
problems (minimum inputs theorem) we introduced simple fast matching algorithms OKS and KS. Further, using topologies 
extracted from real networks, we empirically showed that the minimum number of controllers for structural controllability 
heavily depends on the degree distribution of the network, which in turn implies that the assumption of a random network 
with specified degree distribution is reasonable for many real world networks. Finally, new proof techniques introduced in this 
study enable the rigorous analysis of the the performance of a class of fast matching algorithms for random networks. 

Ruths and Ruths 137) showed that existing random network models, while capturing the key features for predicting the 
minimum number of controllers, are not predicting more detailed control profiles of real networks. This calls for the development 
of new random network models that match control profiles, and associated fast control algorithms. 

Appendix A 
Proof of LemmaQ] 

OO 

For an arbitrary e > 0, let N be large enough such that kp(k) < |. Further, for random network G = (V,E),\V\ = n, 

k=N+l 

remove some edges from G in order to have no vertex of degree larger than N to get network G’ = (V. E’). Now for every 
vertex i £ V there are possibly two reasons for the difference between I), and deg G /(*): 
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• the set of edges we removed from G to get G' 

• multiple edges in the network G' 


so, 


\{ieV : Di = k}\ -\{ieV : deg G ,(i) = k}\ < \E\ - \E'\ + X l Mi ^o 


( 2 ) 


i= 1 


where Mi is the number of multiple edges in G' connected to vertex i. There are at most ( 9 ) pairs of half-edges connected 
to vertex i and for every two of them the probability of the outcome Aj that they both are connected to vertex j is at most 
im- where D = 2\E'\. So, 

V 2 / 


/Af\ U ( N ) 

m *<{ 2 JXx - p (^i £>) -(f)- 


By Markov’s inequality for any S > 0 we have: 


1 n \ 1 n ^ n 1 n 

- x: Wo > SD ) < - s P(M< ^ 0|D) = > l \D)< -,Y.n M i\D) < 


2n( 


nS 


nS 


S(D - l) 2 ' 


But lim = u and 

„ . __ n. > 


lim ads < 1 
n—too n 2 


(3) 


(since ^ kp(k) < § ) imply lim 

fc=iV+l " 7l - >oc 


D—l 


> fj, — e i.e. 


1 


— X Im^o a>p 0 

71 Z ' 


Z=1 


On the other hand, for all k = 1,2,... by the Law of Large Numbers: 

|{»67: D i =k}\ 


lim 

n—> oo 


= p(fc) 


(4) 


(5) 


Finally, because the only reason for the difference between deg G («) and deg G , (i) is the set of edges we removed from G to 
get G' we have: 


|{* € V : deg G (i) = k}\ - |{* £ V : deg G ,(i) = k}\ < \E\ - \E’ 


( 6 ) 


Putting <0 0. 0. 0 all together: 


lim P 

n—too 


\{i&V : deg G (i) = k}\ 


- p(k) 


>e =0 


Since the function f(G) = |{i £ V : deg G (i) = k}\ has the Lipschitz property by Theorem[l] a.s. convergence holds as well 
i.e. p(k) = p(k) 
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For DD(pi n ,p out ), the changes in the asymptotic empirical degree distributions due to omission of 

n n n n 

max{^ D\ m \ Y, D i° Ut) I ^ min{]T E D i° Ut) > 

2—0 2=0 2=0 2=0 

additional half-edges is at most ^ ^ q because by the Law of Large Numbers lim P = 

n i=0 i=0 n->oo n i=0 

n , , 

lim - V £) ■ “ = fi as n 4 oo. The contribution of multiple edges is asymptotically zero similarly. 

n—, KX) n ^_Q 

Appendix B 
Proof of TheoremQ] 

To prove convergence and concentration inequalities for real-valued functions for random networks we use some classical 
notions of probability such as martingale difference sequences. Rhee [38 , Theorem 1] presents a concentration inequality for 
martingale difference sequences. Here we use a slightly more general version of it. The proof presented by Rhee f38l is valid 
by following the same line of reasoning. 

k 

Theorem 7. Let X,;, i = 1, 2,..., k be a martingale difference sequence. If max HXjHoo < M < oo and E(X l 2 |J r ,_i) < 

\<i<k ' j=i 

a 2 < oo then for all t > 0 

PdXA*! >i) <2exp(-^p(^?)) 

i—1 

where p(x) = (1 + x) log(l + x) — x for x > 0. 


We prove the following stronger theorems: 

Theorem 8. Assume f has the Lipschitz property and G = ( V,E ) is a random network in which V = {1, 2,... ,n} and for 
i,j £ V, the edge i —> j exists independently with probability If 

n 

lim sup Pit '* ( X “ Pit ') = 0 

n—f oo Ti 

i,3 =1 

then 0 as n —> oo. Furthermore, if 

lim sup 'jh p$ (1 - pijf ) = 0 (8) 

n—foo . . . 

f/zeu LG) E(/(G')) q a g n o0 _ f/ !e average degree is finite (e.g. ER(X) for X < oo) the rate of convergence 

is exponential. In general given 

sup - pit ( 1 _ Pit ) < c <°° (9) 

n>l 'kl , 

- i,3=l 

we have: 

p( l/( G )-E(/(C))| . 

n C 

Proof: To have more convenient notation we enumerate all possible edges i —> j from 1 to k where k = n 2 for 
directed ER and k = n( ' n + 1 ' > for undirected ER. Indeed, Z,. % = 1,2,...,/;: are independent Bernoulli random variables 
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showing the existence of edges, i.e. if Z* = 1 the corresponding edge exists and if Z, = 0 the corresponding edge does not 
exist. Let Pi,i = 1,2,... be edge existence probabilities, P (Z* = 1), Fo be the trivial sigma-field and for i = 1,..., k let 
Ti = <j(Zi, ..., Zi). Now define a martingale difference sequence as X, = E(/(G)|J r ;) — E(f(G)\Fi-i),i = 1 so 

E(/(G)|J- fc ) = /(G),E(/(G)|J r 0 ) = E(/(G)). Define: 


Ui = E(/(G) Z 1; . 

II 

V r i =E(/(G)|Z 1 ,.. 

II 


Thus, 


E(/(G)|Z l5 ..., £*_!) = Pi Ui + (1 - Pi )Vi 

I (1 -Pi)(Ui-Vi) Z< = 1 
[ - piiUi-Vi ) z, = o 

E(X ? 2 |7i_!) = ft(l - Pi)(Cfi - K) 2 


Since / has the Lipschitz property, |£/j — V*| < M C M = 1 for directed ER and M = 2 for undirected ER): 


max 11 X-i 11 qq < max maxjn;, 1 — ».;) < 1 

l<i<fe l<i<fc 

k k 

y^E(X?|J~i_x) < - Pi) < a 2 < oo 

i=l i—1 

By Theorem [7] for all e > 0 we have: 


|/(G)-E(/(G))| 

n 


> e)< 2exp (~a 2 p(^)) 
a z 


GO) 


lim | implies: 


now if 0 holds then JiG) — y P 0 as n —¥ oo. To show the a.s. convergence, note that ([8]) gives p( l/(G) E(/(G))| > 

n= 1 

e) < oo so by the Borel-Cantelli lemma R G ' I ~^R G '>'> o a.s. as n —> oo. 

Finally using (|9]> we can let a 2 = nC in ([TO]) to obtain: 


|/(G)-E(/(G))| 

n 


> e) < 2exp(-nGp(^)) 


Theorem [8] provides convergence and concentration inequality for ER random networks. A corollary of Theorem [8] is that 


the obtained results are valid for undirected random networks of Chung-Lu type (see page 82 of Durrett l30l ). In a Chung-Lu 
random network the edge between i and j exists with probability “* Wj for the set of weights w i,... ,w n . Conditions 0. 

£ Wk 
fc = 1 
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(|8), <(9> for a Chung-Lu random network will be lim sup = 0, lim sup w " |° g n = 0 and sup w n < oo respectively, where 

n—too n—f oo 

n 

w n is the average weight w n = ± Y w k . 


n> 1 


k =1 


Theorem 9. Assume f has the Lipschitz properrty and G = (V,E), \E\ = k n is UFS. If 


lim sup —^ = 0 


n—foo Tl 


(ii) 


then e(/(G)) q as n 0o _ if furthermore, 


k n log TL 

hm sup- t ,— = 0 


( 12 ) 


then e(/(g)) q a g as n ^ oo. ji ie ra j e of convergence is exponential when the average degree is finite (e.g. UFS(X) 


for X < oo). Namely sup < C < oo implies: 

n> 1 U 


p( 


1/(6?) — E(/(G))| 


> e) < 2 exp(—n—) 


(13) 


Proof: Rhee 11381 Theorem 4] shows 


P(|/(G)-E(/(G))| >f)<2exp(--) 

Kn 

2 _ oo 

If ( fTTj ) holds then letting t = ne we have exp(— |-) —► 0. If ( [T2| > holds then Y P( 

71 n =1 

Borel-Cantelli Lemma (G)) q a g as n 0o _ p 0 show it suffices to let f = ne. 


|/(G)-E(/(G))| 


> e) < oo so by the 


Theorem 10. Let G = (V, E) be DD(pi n ,p out ) or DDfp ) (in the recent case pi n = p ou t = p)- Assuming real-valued function 
f has the Lipschitz property if 


\E\ 

lim sup —tt = 0 

n—f oo Tl 

then e(/(G)) 0 as n —> oo. If in addition 

lims„p |E|1 T ,0 

n—foo Tl 

then F G )~^f(G)) q ag as n —>■ oo. When the average degree is finite i.e. 

\E\ 

sup — < C < oo 

n> 1 Tl 


(14) 


(15) 


(16) 


the rate of convergence is exponential: 




Proof: To form an inequality like ( [T()| let k be the number of half-edges before being paired (so k is the number of 
directed edges - every undirected edge is two directed edges). Enumerate half-edges and for i = 1,2 ,...,k let Z, be the 
random variable indicating the vertex whose half-edge the half-edge i is paired to i.e. Z, = j,j € {1 ,... ,nj if half-edge i 
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is paired to a half-edge of vertex j. Let To be trivial sigma-field and for i = 1^ let T, - o(Z \,..., Z, ). Now define a 
martingale difference sequence as Xi = E(/(G)|J r i) — E(/(G)|J r i_i), i = 1,... , k and Uij, Pij as : 

U ij =E(f(G)\Z 1 ,...,Z i _ 1 ,Z i = j) 

Pij = P [Zi = j\Zi,Zi-x) 


Hence Ui = E(f(G)\Zi,..., Zi_\) = Y2, PijUij and X t = Uij — Ui whenever Zi = j. On the other hand, 

j =i 

n 

nx^Ti-x) = Y; P iAUii - Ui? 

j=1 

Lipschitz property of / implies U, 3 Uu < M ( M < 4 for undirected case and M <2 for directed case). Hence for all 

i = 1, • ■ ■, k: 


Halloo < max || i 11oo < max Pu\\Uij - t/«||oo < M < oo 

l<j<n l<j<n j—^ 

k k n k 

PijM 2 < ^ M 2 < kM 2 < a 2 < oo 
2 = 1 2 = 1 j — 1 2=1 


By Theorem [7] we have: 


n M z a z 


Now ( fl4| . ( p~5] >, ( p~6l > give limsup ^2=0, limsup a ? 2 sn = 0 and sup “ < 16G < oo respectively. 

n—> oo n—too n>l 


Appendix C 
Proof of TheoremQ] 


Letting M^ = n — \Mq[G)\ be the number of unmatched vertices MU) — ?? Mi where 

i—l 



Now note that according to the algorithm 


the vertex picked in *-th iteration is unmatched 
the vertex picked in i-th iteration is matched 


P (Mi = l\Mx, 


i—l 

n— 2+14- ^2 Mj 

■ • ; Mi-x) = q i=1 


n 

For example P= 0) = P(A/i = 0,..., M n = 0) = Yl (1 — qn-i+i^ _ a n (q). We have 

2=1 

P(M< n) =fc)= Y, P(M i = l {ie/} ,Vi = l J ...,n). 

\i\=k 
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If I = {ii, • • • ,h} then 

P (Mi = 1 {ie7} , V* = = (1 - q n ) ... (1 - q n ~ il + 1 )q n ~ il+1 {1 - q n ~ h ) ... (1 - ... (1 - 


a n {q) 


Mq) ) =x 


n ^ ij " 


Thus: 


P(M (n > = fc) = 

afc(g) ^ 


l<ii<...<ifc<n 


But: 


E 


l<zi <...<ifc<n 


n ik~ 1 i2 — l 

E ^ E o 4-1 ■ • • E <7 

ik=k i-k — i —k— 1 2 1 — 1 

" * 3_1 1 _ n il -1 

ik=k 22=2 


23-1 


E«‘‘ 'Ei* 


i k =k 


22 =2 


7 aj 2 -i(g) 

ai(g) a i 2 - 2 {q) 


n ii ~ 1 r, 1 + 2 r 

E^-E? 


ife=fc 


*3=3 
24 — 1 


(l + g)(l-g l3 - 2 )-g(l-g 213 - 5 ) 


= E«“-E 


*)s = fc *3 = 3 


a 2 (g) L 

, 7 +2 «» 3 -i(g) 

a 2 (g) a* a -3(g) 


*(g) |fc(fc+i) 


a k (q)a n _ k (q) 

which establishes the result. Now to show lim = log ( 2 y e —1 for ER(X), for large n let (1 — = exp(— ^j) to have: 

P(M(">=fc + l) i _ g **-fc _ 1 — exp (—^(n — k)) (_ A \ 

P(MW=fc) (1-gfc+t) 29 [i.exp^A^ + i))] 2 P V 7 7' 


Denote f3 k = exp(— ^fc) to obtain: 


P(MW=jfe + l) ^ 2 (l-e- A /3 i : 1 ) 


P(M(") = fc) e« — 2^ + e _ ™/3? ' 


Therefore > 1 if and only if (e _ ™ — l)/3 2 + (e _A — 2)(3 k + e" < 0 i.e. the probability mass function is unimodal 

and as n —> oo, P(A7 n ) = A: + 1) > P (M^ = k) if and only if /Jy 1 < 2 — e _A which is equivalent to k < log ^ 2 y e —7 so 
letting k* = arg max P(A7") = /c) we get P(|M(”) — k*\ > ne) —¥ 0 for all e > 0 as n — > oo i.e. M< —>p log ( 2 y e — 2. By 

0<fc<n 

Theorem [8] a.s. convergence holds as well. To prove the asymptotic normality we will show 

lim 1 ~ P ( M(n) = k W) ^ 

r*^o ° S P(M(") = fc(0)) (7 


where i = lim fc(o) = k* « n/r, p. = IQ8A2 ' e —cr 2 = 7- We have: 

n—>oo v n A 4A 


log(2-e A ) ^2 _ J_ 
A ’ 

n—k(t) 


logP (M^ = k(t)) = V fA-^—) — \nii 2 — \t 2 — 2\^/nyLt 

H ^ 


( 17 ) 
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where the functions fj : 


l,j = 1,2,... are fj(x) = 21og(l — e X ^+R n '> Xx ) — log(l — e x R n ). Note that 


e"*" = (2 — e _A ) _1 . 


( 18 ) 


But for cr,- = e < x and SO me B < oo 


/,-(0) = 21og(l-e- A (' 1+J > ) )-log(l-e-^>) 


2Aa, 


/i(0) = ^ 


m o) = 


—2\ 2 cij 


i/rwi < b - 

Now writing fj(^) = /i(0) + /j(0)^ r + /"(0)|^ + /" , (^)^ r for some t* £ [0,i] and using ([T§, because i(n-fc(f)) 

1 — /i definition of Riemann integral implies: 

“ H /j(°) ->• 2A / gA/j+As _ ! 


da; = 2A/r 


a=i 

z—k(t) 




1-M 


o-A(m+x) 


n 

j =i 

n—k(t) 


(1 — e - A ( p +*))2 


dx = —2A 


1- y /'"(—) < — 


a=i 


i—k(t ) 


i—k(t) 


Note that | i ]T /j (0)—2A/x| < sup and sup ^^ 31 = sup (e ^ e F 1)2 < oo imply X] /j(0)~ 

j=l /x<a;< 1 /i<x< 1 /j,<tc< 1 j=l 

n—A:(0) 

2A/a-y/n —> 0. Now plugging all in <0 we get the desired result since logP(X = k’(0)) = fj( 0) — An/x 2 . 

i=i 

Appendix D 
Proof of Lemma[2] 

To prove the Lipschitz property for maximum matching let G = (L, R, E),G ' = (L, /?. E'), E' = E U {e}, |L| = | R\ - n 
and let M C E' be a maximum matching in G'. If e (j M then M is a maximum matching in G. If e £ M then M — {e} is 
a matching in G so the size of maximum matching is at least |M — {e}\. Thus |M*(G')| — 1 < \M*(G)\ < \M*(G')\. 

For the Greedy algorithm, if n = 1 then clearly \M g {G')\, \M g (G)\ are both either 0 or 1 so \M g (G')\ — \M g (G)\ < 1. 
Assume function \M G \ has the Lipschitz property for all networks of vertex size n — I and define networks H. //' as the 
networks G, G' after the first iteration: H = G — {u, u}, H' = G' — {u r , t/}; u, v! £ L\ v, v' £ R so both H, H' have n — 1 
vertices. 

If u = v! and v = v' then networks H, H' are exactly the same except potentially the new edge e which can be added to 
H in order to get H' . Since H 1 H' both have n — 1 vertices, by the assumption \M G (H') — M G {H)\ < 1. On the other hand, 
Mg(G ) = M g (H) U {(«,«)}, M g (G') = Mg(H') U {(u,i>)} imply \M g (G)\ = \M G [H)\ + 1, |M g (G')| = \M G (H')\ + 1 
i.e. \M g (G') - M g (G)\ < 1. 
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If v ^ v' then the new edge e is connected to v' £ R,u' £ L and dego{v') = 0, degc ( v') = 1. Now there are two possible 
cases: 


1) vertex v! £ L is not used in Mq(G) i.e. there is not any vertex w £ R in Mq{G ) such that v! is matched to w by 

Greedy algorithm applied to G. Therefore iterations of the Greedy algorithm after the first iteration will not change once 
the new edge e is added. In this case Mq{G) = and Mq{G') = Mq{H') U {(z/,z/)}. 

2) vertex u' £ Lis used in Mq(G) i.e. there is a vertex w £ R such that v! is matched to w by Greedy algorithm applied to G : 
()/'. w) £ Mq(G) . Therefore the only iteration of the Greedy algorithm which will change once the new edge e is added 
is the one that vertex w £ R is picked. In this case Mq(G ) = Mq(H')G{{;u', w)} and Mq(G') = Mq(H') U{(V, z/)}. 


In both cases we have \Mg(G')\ — \Mg(G)\ < 1. So \Mg\ has the Lipschitz property. Note that there is no assumption 
about the vertices v, v' picked in the first iteration of the Greedy algorithm. So this proof is valid assuming vertices v. v' are 
of minimum degree, i.e. the proof works for \Mks\ and \Mqks\ as well. 


Appendix E 
Proof of Theorem[2] 

We prove the theorem in two steps. In Step 1, first we embed the dynamics of input and output degree sequences during the 
algorithm in a continuous time Markov process for a random network with bounded degrees. Then, we find explicit expressions 
for differential equations governing the dynamics of degree sequences for infinitely large networks. Then, we show that for a 
finite random network, the dynamics of degree sequences can be approximated by the solution of the presented initial value 
problem. Finally, we show that this solution spends zero time in the cases where there is no vertex of degree one so the number 
of iterations of the algorithm there is no vertex of degree one on the right side of the random network is sublinear w.r.t. the 
size of the network. In Step 2, we generalize the proof to the unbounded, but finite mean, degree sequences. 

Step 1: Embedding in a continuous time Markov process. Assume in addition that the asymptotic empirical degree 
distributions are bounded: Pi n (i ) = p ou t{i) = 0 for i > N i.e. for every vertex v £ L U R we have deg(u) < N. First, we 
embed the dynamics of the algorithm in a continuous time Markov process. To go to continuous time, define G n (t ) as the 
n-vertex network at time t£f where state changes G n = G n — {zt, u} occur at Exp(n) interarrival times. More precisely, let 
... be i.i.d Exp(n) random variables, i.e. the probability density function is ne~ nt for t > 0 so E (r^) = i. The first 
state change G n = G n — {zi, z;} occurs at time t = ti, the second one occurs at t = T\ + r 2 and so forth. Now we construct a 
Markov process on R 2Ar which describes the performance of the algorithm. The transition kernel of the Markov process will 
be described later. Define {t),Y^ n \t) £ R w as: 

Xi n \t) = -|{u e R : deg(u) = k in G n (t)}\, 
n 1 1 

Yu n \t) = - I {v £ L : deg(u) = k in G n {t)} I, 
n 1 1 

for k = 1,2,. ,.,N. In addition let m = m{X^ n ’{t)) be the minimum degree of vertices in R(t)\ m(X( n \t)) = min{fc : 
xj^\t) ^ 0} so letting V\ = v,u\ = u whenever a state change occurs we have deg(z>i) = to. Fet ( Ui,v ) £ E for 
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i = 1,..., m, K = deg(iti) and (it, Vj) £ E for j = 1,..., K. So for a network of size n we have the following conditional 
degree distributions for vertices u\, ..., u rn . v^, ..., vk- 


nkY^ n \t) -kJ2 1 de g (« j )=fc 

P„ (deg(wj) = k\Ai-i(t)) = —^- J -^— 1 -, 

n J2 kY^ n \t) - E deg (uj) 


nkx[ n \t ) - k E 


L deg('U J - 


)=* 


P n (degK) = ^ T -, 

n E kXl n \t) - E deg(ttj) 

fe=i i=l 

where = (deg(ui),..., deg(wj), Y^ (t)),Bi{t) = (deg(tti),..., deg(w*), X^ n \t)). Note that since interarrival times are 
i.i.d exponential random variables, X^ n \t),Y^ n \t) are continuous time Markov processes. 

Letting X^ n \Y^ £ R w be the corresponding vectors after one state change for x,y £ define functions F”,<G n : 


R 2N R N as . 


F"(x, y ) = nE n (X^ - X™ = x, Y {n) = y), 


G n (x,y) =nE„(f (n) -Y (n) \X (n) =x,Y (n) = y), 


where E n is expected value w.r.t P n . Since the process is Markov, probability distribution of X^ n \Y^ depends only on 

x( n ) y( n ). 

Asymptotic initial value problem: Define P(deg(iti) = k) = w fcYfc —EL_ ,P(deg(uj) = k) = ^ Xk ^ . Note that P„,P 

E kY^\t) E kxi n \t) 

k= 1 fc=l 

can be defined for every x,y £ E N with non-negative components. Now, for arbitrary x,y, some algebra gives: 


|P„(deg(ui) = k\Ai-i(t)) - P(degOi) = k )| < —, 

n 


(19) 


|P„(degO, ; ) = k\Bi-\{t)) - P(deg(ui) = k )| < —. 

n 

For Ci = C 2 = Define: 

Z k Vk Z kx k 

= 1 fc = l 

F(x, y) = nE(X( n ) - X^ | X (n ^> = x, F (n) = y), 
G{x, y) = nE(Y <"> - yW |X (n ) = x, E (n) = y), 


where E is expected value w.r.t P. Since 

K 

nX^ ) =n^ ) +Z[ W* j')=k -\-1 ldeg(uj)=fc] 

J=2 

m 

n Ec ^ = «n ( ^+ y~^[ideg(«,-) =fc+l ldeg(tij)=fc] 

i=2 


( 20 ) 


( 21 ) 


(22) 
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inequalities ( fl9| ), ( |20| > yield 

||F"(x, 2/ )-F(x,y)|| 1 <^ A ^ 1 I (23) 

n 

E kx k 

k =i 


II ^ n {x,y) - <G(x,y)||i < 


4iV 4 1 

E ky k n 

k =1 


(24) 


and, 


( ||Ay|!i 1] 11^4*11! ^ SAx Ax ^ lm(x) ’ 

G(x,y) = -^ + r 4^(SAy-Ay), 


\\Ay\\i \\Ay\U 

where A and S are moment matrix and shift matrix respectively, i.e.. A, S £ 


IxN 


, A,j is i for i = j and 0 otherwise, and S t 


is 1 for i = j — 1 and 0 otherwise, m(x) = min{/c : x k ^ 0} and l m £ is the vector in which ?n-th component is 1 and all 
others are 0. Note that ||AX(”)(i)|| 1 = || J 4y( n )(f)||i because for finite n always \E(t)\ = n ^ kX)^’(t) = n E kY^ n (t). 

fe=l fc=l 

Besides, transition kernel of the Markov process can be formulated by JP„ according to ©, ©• 


Approximating the dynamics of the degree sequences by the solution of asymptotic initial value problem: Now we 

can use Kurtz’s Theorem l39l . Given functions F,G : M. 2N —> and positive constant T, define x(t),y(t) : [0, T] —>■ R w as 

the solution of the initial value problems 


x = F(a;,y),a:fe(0) =p in (k),k = 1,. 

••,N , 

(25) 

y = <&{x,y),y k (0) =p ou t(k),k = 1,. 


(26) 


N 

and let £ = {z £ M^such that: e < E kz k < N}. Suppose the following statements hold: 

fc=l 

1) lim sup ||F”(zi,z 2 )-F(zi,^ 2 )||i = 0. 

n ^°° zi,z 2 ef 

2) lim sup ||G"( 2 i, 2 2 ) — <G(.Zi, 2 2 )||i = 0. 

n_> ' 00 Zl,Z 26 f 

3) for all k = 1,..., N , lim X (0) = Pi n (k). 

n—> oo 

4) for all k = 1,... ,N, lim Y fc ( ” ) (0) = p out {k). 

n—too 

5) functions F, G are Lipschitz (in the classic sense). 


Then 


lim P„ (3t £ [0, T] : m{X^ n \t)) ± m(x(t))) = 0. (27) 

n—>■ oo \ / 

N N 

Letting T = T(e) = sup{t : E kx k (t) > e, E k yk(t) > e} for some arbitrary e > 0, the first two conditions are satisfied by 

k =1 k —1 

( |23| l, ( |24| . By the definition of asymptotic empirical degree distributions lim x[ n \o) = Pin(k) and lim y!" 1 (Q) =Pout{k). 

1 1 1 1 n—>oo K n—>oo K 

On the other hand, the initial value problems (|25j), (|26|> have unique solutions since defining metric d on R ;V as d(x. y ) = 
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||x — y||i + 1 m (x)^m(y)^ IF,G are Lipschitz with respect to this metric i.e. there is a B < oo such that for all x,x',y,y' £ 

d(F(x, y),¥(x', y')) < B(d{x , x') + d(y , y')), 
d(G{x, y),G(x', i /)) < B(d(x , x') + d(y, y’)). 


Note that stopping time at T(e), i.e. when en edges are remaining in the network to be removed by the algorithm, will not 
cause any problem since continuing the algorithm from that point on cannot add more than e edges to the matching on a scale 
relative to the number n of vertices. 

Properties of the asymptotic initial value problems: The useful fact about the solutions of < [25| ), ( [26] ) is that Lebesgue 
measure of the set {0 < t < T : m(x{t)) > 1} is zero. Suppose it is not. So there are 0 < t\ < t 2 such that X\(t) = 0 for all 
t £ [ti,t 2 ] so dx ^ = 0 for all t £ (ti,^ 2 )- But 




-(2x 2 (t) - xi{t)) 


dt v \\Ay\h '\\Ax\U 

implies x 2 (t) = 0 for all t £ {ti,t 2 ) which means m(x(t)) > 2 for all t £ {t\,t 2 ). Repeating this argument for x 2 now we 
will get m[x{t)) > 3 and so on. Therefore, m(x(t)) > N which is impossible. Thus, the set {0 < t < T : m(x(t)) > 1} has 
zero Lebesgue measure. 

Sublinearity of the number of iterations of the algorithm with no degree one vertex: Let C {1,2,... , n} be the 

set of indices i of iterations of OKS such that after the /-th iteration the minimum degree on the right side of the network is 
larger than one. Since the set {0 < t < T : m{x(t)) > 1} has zero Lebesgue measure by ( |27j ) Lebesgue measure of the set 
{0 < t < T : m(X( n \t)) > 1} goes to 0 as n grows. Because the Lebesgue measure of the set {0 < t < T : m(X^ n \t)) > 1} 
is T i+i we have lim ^ 77+1 = 0 but by the Law of Large Numbers lim p^yj nT i+i = F(nTi) = 1 . Therefore 


lim 


£JW 
N (n) l 


n_>00 ieJ(») 
(») 




= lim 


Y2 nTi+i = lim Y2 T i +1 = 0 i.e. the number of iterations of OKS algorithm for which 

ieJ M n ^°°iej<.n) 


there is no vertex of degree one on the right side of the network is sublinear w.r.t. the size of the network. 


Sublinearity of difference between the output of OKS and maximum matching: Using lim 


U (n) l 


= 0 we prove that 


the size of the matching provided by the OKS algorithm is away from maximum matching size by a sublinear factor. Starting 
the algorithm, as long as the minimum degree on the right side of the network is one, OKS makes no mistake, i.e. the size of 
the matching by OKS is the same as the size of maximum matching. When the minimum degree is m = rn(X ( ' n> (t)) > 1 it 
is possible that OKS picks a vertex on the left side which is not the optimal choice. We make it optimal by manipulating the 
network: if v £ R, u £ L , deg(u) = m are the chosen vertices in the iteration of the algorithm to be removed from the network, 
Moks = Moks U {(it, d)}, manipulate the network by removing all other m— 1 edges connected to v. Since \M*\ has the 
Lipschitz property, removing these m— 1 edges will change the size of the maximum matching by at most m — 1. Since m is 
the minimum degree and the average degree is bounded, m — 1 is bounded as well. On the other hand, the number of iterations 
that OKS will face such cases is sublinear w.r.t. the size of the network, so the whole number of possible errors, or in other 
words, the whole deviation from maximum matching made by OKS is sublinear, i.e. lim I m °ks(G)\ _ p m I M ( G )l . 

Step 2: Generalization to unbounded degree. Now to generalize the proof to cases where the asymptotic empirical degree 
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distributions are not bounded, we use the classical technique of truncation. For arbitrary e > 0, let N be large enough such 

(X) oo 

that J2 kpin(k) < kp ou t{k ) < |. In random network G remove some edges in order to have no vertex of degree 

k = N+l i = N+l 

larger than N to get random network H which has bounded asymptotic empirical degree distributions. By Step 1, 


lim 

n—>oo 


I Mqks(H)\ 

n 


lim 

n—>oo 


n 


(28) 


Because by Lemma [2] both functions \M*\,\Mqks\ have the Lipschitz property and asymptotically the number of edges 
removed from G to get H is less than ne: 


lim 

n—t oo 


I \M*(H)\ - \M*(G)\\ 


< G 


(29) 


lim \\M° k s(H)\~ \Moks{G)\\ 

n—>oo Tl 


(30) 


Now ( |28| r ( f29] >, ( [30] > imply the desired result. Further, when e —> 0, N — > oo, so formally we can take N = oo and write the 
functions F, G : K°° —> K°° as 


F(a ;,y) 
&(x,y) 


U 2 V Hi 

\\Ay lit 

Ay 

'WMi 


^WMi 

m{x) — 1 

\\ A v\\i 


(S Ax Ax) 1 m{x)i 
■( SAy-Ay ), 


for matrices A, S G R°° X00 


OO 

provided ||A 2 x(0)||i = £ k 2 p in (k) < oo or \\A 2 y(0)\\ 1 


k =1 


oo 

S k 2 Pout{k) < oo. 


fc=l 


Appendix F 

Proof of Theorems |5|6| 


As we saw in the proof of Theorem [2] The asymptotic performance of Greedy, OKS and I\S algorithms can be described 
by asymptotic empirical degree distributions which are solutions of some initial value problems. If we find functions F,G : 
K°° —> R 00 for Greedy and KS we will have 


j|Ax||i _ y 

G( ^> = 


for Greedy and for KS 


F (x,y) = 


%m + Urn 




Urr 


%m + Dm 


Ax m — 1,0. . , 

piib + i|A* (SAl ‘ Aa:) 


<&{x,y) = 


Dm 

%m + Un 


“ 1} " Ay) ~ u 

\\ A v lit 


%m “l - Vn 


Ay nn— 1 

jAyfAUvh 


{SAy-Ay) 
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where m = mm{m(x(t)),m(y(t))} is the minimum degree. Since for KS we have G(a :,y) = F (y,x), when a;(0) = y(0) 
(i.e. p in = p ou t) x(t) = y(t) and 


• wt ■n ( \\ A2x h , _ SAx-Ax Ax l m 

V \\Ax\\i 2\\Axh 2||Ax||i 2 

Thus for any degree distribution p the dynamics of KS is the same for both DD(p) and DD(p,p). Further, regarding the 
results provided in Theorems [5] [6] the relative size of the output of the algorithm as well as the dynamics of the algorithm is the 
same for all random networks ER( A), UFS(X), DD(p) and DD(p,p) where probability distribution p is Poisson(A) because 
they all are sharing the asymptotic empirical degree distributions. So Theorem 4 implies lim ^ Mg ^ g ^ = 1 — los ^ 2 T e —-. For 
KS, all mentioned statements are proved for undirected ER( A) by Karp and Sipser l22l so are valid for the desired class of 
random networks. 


Appendix G 
Proof of Theorem[3] 

OO CO 

Here we assume in addition that ||A 2 :r(0)||i = ^ k 2 pi n (k ) < oo and j|A 2 y(0)||i = ^ k 2 p ou t{k) < oo. Generalization to 

k —1 k =1 

the case where above quantities are not bounded is straightforward similar to Step 2 in the proof of Theorem [2] and is omitted. 
Because lim *- G ^ = lim it suffices to show U* = lim 1 — E f i<s(G)\ . To show the latter claim, we run KS 

n—too n n—too n n—toc n 

algorithm and find the number of vertices left unmatched by the algorithm. Similar to what we did in the proof of Theorem 


[2] the asymptotic fraction of vertices left unmatched by I\S i 


is 


1 — lim 

n—>oo 


\M ks (G)\ 


1 

= Pin ( 0) + J 


A\ A2 y\\i _ ^ + (w - 1 )y„ 
7 || A V ||l X m T Pm 


Xi(t) 

WMi 


dt 


where 


T = sup{f : ^ kx k (t ) > 0, ^2 k Vk{t) > 0} 


k =1 


fc=1 


x = F(a;,y),x(0) = p ir 


V = V(y,x),y{ 0) =p out 


F(x, y) 


' ( \\A 2 y\\i 

: wmu 


1) 


WMi 


(.SAx 


Ax) 1 m 


Um 




+ Vm 


Ax 




and m = min {m(x(t)),m(y(t))} is the minimum degree. Since as we saw in the proof of Theorem [2] the set {t : m(x(t)) > 
1 ,m(y(t)) > 1} is of zero Lebesgue measure without loss of generality in all integrations we can assume m = 1, especially 


Now define: 


1 — lim 


\M K s(G)\ 

n 


T 

= Pin { 0) + J 
0 


xi + yi \\ A v Hi 


i) 


Xi(t) 

||Ax||i 


dt 


X 0 (t) = p in ( 0 ) + 


xi A\ A 2 y(s )lit _ n xi(s) 

X! + yi [ II^OOIl! ’\\Ax\h 


0 
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U(t) = ^ $ in(t,W 2 (t )) + $ in (t, 1 - Wx(t)) + ®out(t,Wi{t)) + $out(t, 1 - W 3 (t)) - 2 

+ p(t) [w 3 (i)(l - w 2 (t)) + wi(t)(l - W 4 ,(t))] 

Since Xl (T) = x 2 {T) = ... = 0, yx(T) = y 2 (T) = ... = 0 we have V(T) = 0, p(T) = 0. But V(0) = 1 - p in ( 0) and for 

TO = 1 

v( f \ = d Mh = Xi [ J^lli ^ x * il yi 

dt xx+yx [ ||^t/||i '||Ae||i J xi+yx 

, \\ A2 yh n x i 1 
xi+yx \\Ay\\x ’WAxWx 

= ~Xq (t) - 1 

i.e. — 1 + Pm(0) = V(T) - V(0) = a; o (0) - x 0 (T)-T = p in { 0) - x 0 (T) - T. Therefore $ in (T,u) = x 0 (T),$ out (T,u) = 
Vo(T),U(T) = 1 — 2T and 

1 - lim lMjCg(G)1 = x 0 (T) = y 0 (T) = 1 - T (31) 

n—>• oo Tl 
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On the other hand, as long as rn - 1: 

Kt) = 


dy 

2/i 

r n^iiii 

+ 3:1 

dt 

xi + 2/1 

L \\Ax\\i\ 

Xl + 2/1 


xi \\A 2 y ||i 


\\Ay ||i 
yi P 2 ^lli 


,\\A 2 y 111 n -Xi - ||Az||i + x : 

v II A~.\\ " ' " 1 


l|Ac||i 


xi +y i HAylli xi +yi ||Ax||i 

^iP 2 y||i + yiH^ 2 ^lli 

{xi + yi)\\Ax\\! 




1 1 ( ''' - - W 2 (t)) -W 2 (t) 


+ 


+ 


y i 


an + 2/i 

Xl 

xi + 2/1 

2/i 

xi + 2/1 


[ - W 2 {t)(j)i n {t, w 2 (t))] + \\Ax\\iW 2 (t)(f>i n (t,W 2 (t)) 

( - 1)«/3(*)(1 - W 2 (t)) ~ W 2 (t) 

-W 2 (t)w 3 (t)] + \\Ax\\iW 2 {t)(j>i n (t,W 2 {t)), 


dt 


®in(t, 1 - WlO)) = 


Xi 


xi+yil \\Ay\\i 


(i+rrr 1 - 1 - wi(*))u>i(i) - 1 + «n(i) 


2/i 


xi + 2/1 
ah 

ail + 2/1 
2/i 

ah + 2/1 


- 1 +iui(t)0 in (t, 1 - wii(£))] - ||Ax||iii;i(t)0i n (i, 1 - iui(i)) 
_ 1 ) w i( t )( 1 “ w ^)) - 1 + wi(t) 

[ - (1 - wi (*))(! ^ ^ \\Ax\\iWi(t)<j) in (t, 1 - «/i(t)), 


— [/z(i)u/ 3 (t)(l - wi 2 (i))] 


/i(2)w 3 (2)(l - w 2 (t)) + y(t)w 3 (t)( 1 - W 2 (t)) - y(t)w 3 (t)w 2 (t) 
Xi\\A 2 y\\i+yi\\A 2 x\\i' 


(xi + yi)\\Ax\\ 


w 3 (t)( 1 - w 2 (i)) + \\Ax\\i [ti> 3 (t)(l - w 2 (t)) - w 3 (t)w 2 (t)]. 


The above equations imply: 

o dU{t) 


dt 

Xl 


xi + 2/1 

Xl 

xi + 2/1 

2/i 

aii +2/i 

2/i 

aii +2/i 


( ^ !)w3W(1 - w 2 {t)) - w 2 (i) 

(^pyjjr ~ 1 ) Wl _ Wi ^)) ~ 1 + Wl (*) 

(t+TTT " X ) Wl ~ M*)) - W4 (i) 

Pai||i 

( +4 if 1 _ 1 ) w s(i)(l - (*)) - 1 + w 3 (t) 

11 Ax 111 


2/i 


Xi + 2/i 


2/i 


aii + 2/1 

Xi 


[-w 2 (£)«+(£)] + \\Ax\\iw 2 (t)(f>i n (t,w 2 {t)) 

[-(1 - wi(t)){l - w 4 (t))} - \\Ax\\iWi{t)(/) in {t, 1 - 


xi + 2/1 

Xl 


aii + 2/1 


—xu 4 (£)it7i(t)] + ||A2/||iit)4(t)^ in (i,ty 4 (t)) 

-(1 - w 3 (t))(l - «J 2 (t))] - \\Ay\\iw 3 {t)(j)i n {t, 1 


+ (- 


xi||A 2 '2/||i + '2/i||^4 2 x||i 
(aii + yi)\\Ax\\i 


)w 3 {t){ 1 - «/ 2 (t)) + ||-4a;||i [w 3 (t)(l - w 2 (t)) - w 3 (t)w 2 (t)] 


+ (~ ' ri ^4 X ^ 1 ) u ’i( t )( 1 “ M*)) + ||j4-ar||i [wi(t)(l - w A {t)) - wi(t)w 4 {t)] 

(xi +y 1 )\\Ax\\i 


Wi(t)) 


w 3 (t)) 
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i.e. 


4 u ^ 


Xi 


Xi + y 1 

M 2 y\\ 


(lU 3 (f)(l - W 2 {t)) +'W 1 (t)(l - W 4 (t))) + “ 1 ) W 3(0( 1 - w 2 (0) - w 2(0 


Pf/|b 


Ps/lb 


+ ( I . 1 - - w 4 (t)) - 1 + Wi(t) -w 4 (t)w 4 (t) - (1 - W 3 (t))( 1 - w 2 {t)) 


\\M\ 


yi 


IP 2 


X 1 


xi+yi [ ||Aa:||i 


(w 3 (t)( 1 - W 2 (t)) +1t)l(f)(l - W 4 (f))) -W 2 (t)lV 3 (t) - (1 - 1«l(f))(l - 104(0) 


+( II |^||| II 1 1 - i)wji(*)( i - to 4 (t)) - w 4 (0 “ 1 ) W3 (0( 1 - ™ 2 {t)) ^ 1 + w 3 (0 

P^||l [lt> 2 (f)l0 3 (f) - 1Ul(f)(l - W 4 (0) +W 4 (t)w 4 (t) - W 3 (t)( 1 - 10 2 (0) 
to 3 (0(l - w 2 (0) - w 3 (t)w 2 (t) +toi(0(l - 104 (f)) - ioi(f)io 4 (t)] ■ 


Simplifying 


4 u < t) 




xi + y i 


^ y —(io 3 (f)(i - u> 2 (f)) + ioi(t)(i - to 4 (f))) +irr^rr ll ' u; 3(0( 1 - w 2 (f)) - w 3 (t){i - w 2 (0) 


IPylh 


IPi/lb 


-10 2 (f) + -^Pp-U>i(f)(l - 104(f)) - 10i(f)(l - 10 4 (f)) - 1 +10i(f) - w 4 (t)wi(t) - (1 - io 3 (t))(l - w 2 (t)) 


y 1 


IP 2 /II 1 

IPPII 


*1 +2/1 

, P 2 0li 


\\M 


— ( 103 (f)(1 - 102 (f)) +101 (f)(l - io 4 (f))) - io 2 (f)io 3 (f) - (1 - 1 Oi( 0)(1 - to 4 (0) 


II , I, 1Ol(0(l - 10 4 (f)) - 1Oi(0(l - 1O 4 (0) - 1O 4 (0 + | 103 (0(1 - 102 (0) - 1O 3 (0(l - 1O 2 (0) - 1 + 10 3 (0 

P*l|l P®l|l 


yields 


2-^-[/(f) = 3:1 [-2] 

dt X\ + 2/1 


1/1 


[-2] = -2 


zi + 2/i 

Again, since the set {t : m > 0} is of zero Lebesgue measure integrating both sides of U(t ) = —1 we have: 


1 - 2T = U(T) = U( 0) - T. 


So U* = [/(0) = 1 — T which is the desired result by ( |3T| . 
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